A Hybrid On-line Topic Groups Mining Platform

نویسندگان

  • Cheng-Lin Yang
  • Yun-Heh Chen-Burger
چکیده

In recent years, there is a rapid increased use of social networking platforms in the forms of short-text communication. Such communication can be indicative to popular public opinions and may be influential to real-life events. It is worth to identify topic groups from it automatically so it can help the analyst to understand the social network easily. However, due to the short-length of the texts used, the precise meaning and context of such texts are often ambiguous. In this paper, we proposed a hybrid framework, which adapts and extends the text clustering technique that uses Wikipedia as background knowledge. Based on this method, we are able to achieve higher level of precision in identifying the group of messages that has the similar topic.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A review of text mining approaches and their function in discovering and extracting a topic

Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling.  Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...

متن کامل

Issues for On-Line Analytical Mining of Data Warehouses

Data warehouses and OLAP engines are expected to be widely available in the near future. The data in data warehouses has been cleansed, integrated, and preprocessed, and infrastructures have been built surrounding data warehouses for e cient data analysis. Therefore, data warehouses or OLAP databases are expected to be a major platform for data mining in the future. We discuss the issues relate...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

A Hybrid Mining Approach to Facilitate Health Insurance Decision: Case Study of Non-Traditional Data Mining Applications in Taiwan NHI Databases

This study examines time-sensitive applications of data mining methods to facilitate claims review processing and provide policy information for insurance decision-making vis-à-vis the Taiwan National Health Insurance databases. In order to obtain the best payment management, a hybrid mining approach, which has been grounded on the extant knowledge of data mining projects and health insurance d...

متن کامل

Sentimental Analysis of Twitter Data using Text Mining and Hybrid Classification Approach

Opinion Mining is an important concept in today’s world and due to the advent of social media it has become a huge source of database. Since almost everybody in the modern era is involved with some social media platform, the public mood is hugely reflected in the social media today. This thesis proposes to utilize this source of information and predict the sentiments of public towards a particu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015